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Abstract 

We review a minimum set of notions from our previous paper on 
structural properties of SAT |5j that will allow us to define and discuss 
the "complete internal independence" of a decision problem. This 
property is strictly stronger than the independence property that was 
called "strong internal independence" in [5]. 

Next, we show that SAT exhibits this property. We argue that this 
form of independence of a decision problem is the strongest possible 
for a problem. 

By relying upon this maximally strong form of internal indepen- 
dence, we reformulate in more strict terms the informal remarks on 
the exponentiality of SAT that concluded our previous paper [H]. The 
net result of that reformulation is a hint for a proof for SAT being ex- 
ponential. We conjecture that a full fledged proof of that proposition 
can be obtained by strictly following the line of that hint of proof. 

1 Introduction 

In a previous paper [5J we used strings (borrowed from Computability [9J) 
in order to formulate various notions of internal independence of a decision 
problem. We proved that SAT exhibits a structural property that we called 
"strong internal independence." We could then prove various results on SAT 
using that property. 

In this paper we prove that SAT exhibits a strictly stronger form of 
structural independence, that we call "complete internal independence." We 
argue that this form of independence of a decision problem is the strongest 
possible for a problem. Using this stronger property, we will be able to 
give more explicit form to the informal remarks on the time complexity of 
SAT that concluded our previous paper. As was the case for that paper. 
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these remarks are only intended to suggest that there is relation between the 
structure of the kernel of a decision program P (as defined in cited paper) 
and the time complexity of P. 

To prove this stronger property of SAT we do not need the full machinery 
of concepts and techniques that were developed in [5] (these were used to 
prove that SAT has no wizards, and that all programs that solve SAT have 
same kernel). So, we shall take advantage of our present simplified context, 
and will offer a development of the most elementary aspects of the theory 
at an introductory level. We shall also take occasion to put in due evidence 
the roots of the theory: We take ideas from previous work of Scott [TT], and 
Larsen and Winskel |8], and use results of Di Zenzo, Bottoni, Mussio [1]. 

2 Introductory Remarks on Strings 

We define a string to be a partial function f : N J] having finite domain. 

is the set of the positive integers and S is an alphabet which includes 
and 1. In simple words, a string g being included (or subsumed) in a word 
X is that which remains of that word if we cancel out zero or more letters, 
while leaving blanks in places of letters. We define Soo to be the set of all 
strings over S. 

We regard prescription that a word x over S may or may 

not satisfy. If word x includes string g, we think that x satisfies prescription 
g. Two strings are compatible if they prescribe same values to arguments in 
the intersection of their domains. Otherwise they are incompatible. 

If Dom{f) is an initial segment of N, Dom{f) = {1, ..,"«.} for some posi- 
tive integer n, then we say that / is a full string or word of length n over S. 
Thus, words are certain special strings. It is desirable to have a reserved name 
for describing the new mathematical objects, and we consider that "string" 
is the appropriate one: It is required that one remembers that "string" and 
"word" have different meanings. 

The new objects, the strings, can be viewed as generalized words which 
can undergo splittings and rejoinings according with more flexible schemes 
as compared with words. The strings actually occur as generalized words: 
They represent fragments of words, and we think of them as partial words. 

There is a duality between strings and words, and one of the challenges 
of this theory is to formalize this duality in a useful manner. This is the 
purpose of the Galois connection in [S] . In this paper we do not need the full 
machinery of that connection, and we will just use some of the results. The 
important result is that whenever we have a set A of words, we can set up 
a second set Log (A) (or LogE^A) in a relativized theory) that one may wish 
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to regard as jinnee or genie of given set A. The new set, the logogram of 
A, is a set of strings: the genes (so to speak) of the words in A. 

When we deal with strings we think of a game in which one reconstructs 
words from strings. Equivalently, the game is to classify a word by matching 
it with strings taken from some vaste repository of strings. There are various 
basic operations that we can perform on strings. These include (i) to join 
strings to form new strings and (ii) to extend a string to form a word. These 
seem to be the very basic operations on strings. 

If we join two incompatible strings we just get the void set 0, which is to 
say that output vanishes. If we join two compatible strings we obtain a new 
string which encapsulates each of the given strings. 

The join of two compatible strings f,g is a new string that we denote 
f + g. In order that the operation of join may be applicable to any pair of 
strings, we shall also define the join of g and g. This we take to be g itself. 
Thus join is idempotent. 

Further, in order to be able to iterate the operation of join we must define 
joins of sets of strings. This is an important step in our theory, and yields 
surprisingly reach algebraic structures. We define the join of two nonempty 
sets of strings H, K to be the set of all strings formed by joining one string 
from H with one string from K and aggregating all the joins formed in this 
way in a new set of strings H + K. The move of focus from join of single 
strings to that of whole sets of strings opens wide horizons. 

3 Introductory Remarks on Independence 

In this paper wc make extensive use of the strings and of their associated 
sets, the cylinder sets. As usual in various fields of application, it can be 
convenient for us to speak of special classes of sets (the cylinder sets, in this 
case) as being events: This will also make the recourse to arguments using 
notions of dependence/independence as natural as possible. In a sense, it 
suggests these arguments. 

To every event A there corresponds a contrary event "not A," to be de- 
noted A^. Event A^ occurs if and only if A does not occur. Always remember 
that we are considering words and sets of words, and the whole set is S*. 
An event may imply another event: A implies B if, when A occurs, then B 
necessarily occurs. If A implies B and also B implies A, then we conclude 
that A and B are one and the same event. Events are combined into new 
events by means of operations expressed by and, or and not. The event "A 
and 5" is denoted A{~\ B, or simply AB. It occurs if and only if both the 
event A and the event B occur. AB cannot occur if AB — 0. This is the 
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case when, if A occurs then B does not occur, and if B occurs then A does 
not occur. When AB = we say that A and B are incompatible. Event "A 
or is denoted AU B. 

Let us have a more detailed look at incompatibility between events. As- 
sume that both A and B are nonempty and incompatible. Then AB = 0. 
Then we can rely upon the following three circumstances (i) That there is a 
word X which is in A but not in B, (ii) That there is a word y which is in B 
but not in A, (iii) That there are no words z being both in A and in B. It 
is not guaranteed that there exists a word w which is neither in A nor in B. 
For this fourth circumstance to hold the additional requirement that AU B 
shall not be coincident with the totality of all words S* should be satisfied. 

4 Introductory remarks on NP relations 

The applications that we have on agenda deal with certain structural prop- 
erties that a decision problem can exhibit. When present, these properties 
express internal independence of the problem. To correctly formulate these 
properties, we need to refer to a decision problem as a pair {E, F) of sets, 
the second included in the first. Indeed, the properties that we establish are 
about E, F, and how F is embedded in E. 

Our main objective consists of the decision applications, in particular 
the application to SAT. In this paper (as in companion paper [5]) we set 
forth new techniques to attack the decision problems. We consider a generic 
decision problem 11 and assume that the instances of 11 are encoded as words 
over some fixed problem alphabet S. We call E the set of those words that 
encode instances of 11. Obviously, i? is a recursive set, and we assume it is 
infinite. In any decision problem 11 there is a second recursive set F being 
a proper subset of E. The words in F encode those instances of 11 that 
we intend to recognize from all of the remaining instances. We refer to this 
encoded decision problem as problem {E,F). 

Thus, we define an (encoded) decision problem to be a pair {E, F) of 
recursive sets of words over an alphabet E where F, the target set of the 
decision, is a subset of the reference set E. For example, in the problem 
of deciding satisfiability of boolean formulas, E = CNF is the set of the 
strings that encode formulas in conjunctive normal form, and F = SAT is 
the subset of the satisfiable formulas. 

Relations over an alphabet Let G be a subset of the Cartesian product 
S* X S* so that G is a relation on words over S. There exist exactly one 
set A and exactly one set B with the properties (i) a: G A is equivalent to 
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relation 3yG{x,y), (ii) y E B is equivalent to relation 3xG{x,y). These sets 
are first and second projection of relation G. The first projection will be 
denoted Dom^G), the second is Cod{G). 

NP relations We say G is decidable in polynomial time if there exists a 
deterministic algorithm which decides membership in G of any pair (x, y) 
in time polynomial in the length |x| of the input instance x. We say that 
G is polynomially balanced or else polynomially bounded if there exists a 
polynomial p such that (x, y) in G implies \y\ < p{\x\). 

A relation G which is both polynomial-time decidable and polynomially 
balanced is an NP relation [10]. A language L is in NP iff there exists an 
NP relation G such that L = Dom{G). In short, L is in NP iff L is the first 
projection of some NP relation. When this happens, we say G is a defining 
relation for L. Note that a language L being in NP may have more than one 
defining relations, which is to say L can be first projection of more than one 
NP relations. 

With each NP relation G C E* x S* we associate the following search 
problem: Given x find y such that G{x, y) or state that no such y exists. 

P relations Let G be an NP relation. Thus G is polynomially bounded, 
and there is a search problem associated with G. We say that G is a P relation 
iff there exists a polynomial-time algorithm that given x finds y such that 
G(x, y) or states that no such y exists. Equivalently, we say that G is a P 
relation if G is an NP relation and, besides, the search problem associated 
with G is solvable in polynomial time. Let L be a language in NP. Then 
there exists at least one NP relation G which defines L. Language L is in P 
if and only if at least one of the NP relations whose first projection is L is 
actually a P relation. 

Standard characterization of NP Let LI be a problem in NP, encoded 
as a pair {E, F) over some problem alphabet S. In the standard characteri- 
zation of class NP, there exists a sequence yi,?/2, .. of specials words that are 
called the solutions of problem {E, F). In general, given any generic problem 
instance x in E, we have that x is "satisfied" by certain solutions. There 
are also instances x which fail to be satisfied by any solutions, and we call 
them unsatisfiable. What "satisfaction" means operationally is proper of the 
problem {E, F) under study. 

With any particular NP problem [E, F) one associates a recursive func- 
tion a{n) such that the solutions that can possibly satisfy an instance x such 
that |x| = n are all comprised between yi and ya{n)- 
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Associated with solutions yi, 2/2 ,•• there is a decomposition of target set 
F into subsets Fj called solution regions, where Fj is the set of those x's 
that are satisfied by yi. The obvious relation F = (Ji -^i holds. 

5 Alphabets, strings, and words 

By an alphabet S we mean a finite set of elements called symbols. Any finite 
sequence of symbols from S is called a word over E. The size (or length) of a 
word w, noted \w\, is the number of symbols composing the word. The size 
of a string equals the maximum number in its domain (strings are partial 
functions, hence every string has a domain). For strings that are words the 
two numbers coincide. 

In this paper "string" and "word" are not synonimous. We borrowed 
strings from Computability, where a string over E is a partial function g : 
TV — > {0, 1} with finite domain. If the domain of string g is an initial segment 
of N then g is a word. 

We write S" for the set of all words of length exactly n over E while E* 
is the set of all words over E as usual, thus E* = Un 

Echelons of strings We define E„ to be the set of all strings of size at 
most n over the alphabet E. We say that E„ is an echelon of strings over E. 

The empty string _L is a member of the set Eqo of all strings over E. Its 
length is 0. The empty string ± belongs to all echelons of strings. The meet 
of any two strings in T,^ is a string in E„. The join of any two compatible 
strings in E„ is a string in E„. 

Finite character of languages Let S be any set and S any class of subsets 
of S. We say that S (as well as its members) are of finite character if there 
exists a class JF of finite subsets of S such that any A ^ S is univocally 
determined from its intersections T n A with the elements T of class JF. 
We take the class jC — >C(E) of all languages over E as class E and set 
— IE" : n e N}. We then have that any language L G £ is univocally 
determined by its intersections E" fl L with the members of class JF (it is also 
interesting to note that these intersections are disjoint taken two by two). 
By way of consequence, L is of finite character. 

Note that the union and the intersection of two languages reduces to the 
union and the intersection of their respective echelons. More specifically, if 
A and B are sets of words over some alphabet E, then we may well form 
the union AU B. However this is actually done echelon by echelon, namely 
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AU B = U„(^ U S)" = U„(^" U 5"). Analogously for the intersection 

AnB = u„(A n BY' = u„(A" n 5"). 

It will scarcely be the case that we consider sets encompassing some of 
the echelons of a language L. In our arguments, either all of the echelons 
are considered simultaneously (when we are reasoning at large, that is in 
terms of infinite sets) or else only one of the echelons is under focus (when 
we reason echelon by echelon). We may well put under focus the union of 
all the echelons, which is the whole of set L (when we go at large) or else 
we may take one echelon at a time. In many cases going echelon by echelon 
is convenient, e.g. when we deal with nonuniform circuit classes. In our 
previous paper [5] we considered single echelons in order to derive our results 
about SAT. 

The finitary character of languages will be of help in our study. It will 
make it possible to break various infinitary statements down to finitary ones. 
Besides, it will make it possible for us to regard an infinite set of words as 
the disjoint union of its echelons (which are finite sets). Note that the finite 
character of languages manifests itself as algebraicity of the closure operators 
associated with the Galois connection in [3]. 

Basic properties of strings In this paragraph we establish various tech- 
nical concepts regarding the objects in Sqo namely the strings over S. 

Set Eoo is partially ordered in a straightforward way. Given any pair of 
strings f,g E Soo, we say g is an extension of / (written f < g or g > f) 
as soon as Dom{f) C Dom{g) and g takes exactly the same values as / in 
Dom{f). If f < g and g < f then f = g. When f < g but not f > g, we 
write f < g and say that g is a. proper extension of /, or, equivalently, / is 
a proper substring of g. 

We define _L to be the null partial function N ^ H. Considered as a 
prescription, ± is satisfied by all words a; G S*. Note that Dom{l.) = 0. 
We say _L is the void (or null) element of set Soo- Any element in is an 
extension of the bottom element ±. Thus, the ordered structure (Sqo, <) has 
a least element ±. 

In addition to order, Sqo is equipped with a relation of compatibility 
(or consistency). Two elements / and g of Sqo are compatible as soon as 
f{x) = g{x) for X in Dom{f)r\Dom{g). If we regard / and g as prescriptions, 
we say that they are compatible as soon as it is not the case that they assign 
different values to one and the same entry of a word. If /, g are disjoint, which 
is to say Dom{f) fl Dom{g) = 0, then / and g are certainly compatible. We 
call /, g incompatible as soon as they fail to be compatible. 

Given any pair of compatible strings /, g, we define their join / + 5' to be 
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the least string which is an extension of both / and g. Thus f,g < f + g 
and Dom{f + g) is exactly Dom[f) U Dom{g). Join is the most pecuhar 
operation that we can perform with two strings as operands. The join of two 
compatible strings a, 6 is a new string which subsumes both a and h. The 
join of two incompatible strings is the void set of strings. 

To be able to iterate the operation of join we define the join of two sets 
of strings. Thus we define the join of the nonempty sets of strings H, K to 
be the set of strings formed by joining each string of H with each string of 
K and aggregating all the joins formed in this way in one set. Thus, the join 
of H and K is the set of strings H + K = {a + b: aEH,b^ K}. 

For any arbitrary subset H of Soo we define if + = + i^f = 0. 

The meet f /\ g oi any two strings is the restriction of / (or g) to that 
portion of the intersection Dom{f) fl Dom{g) where / and g agree. Thus 
f,gl^ f /\ 9 and Dom{f A g) is included in Dom{f) fl Dom{g). 

Consistent Sets of Strings Let H be any subset of the space Sqo. We 
say that H is reduced as soon as no string g & H is properly included in 
another string f ^ H. Given any set of strings H, we denote by notation \H\ 
the set of all those strings in H that do not properly include other strings in 
H. It is immediately verified that H and \H\ are isoexpansive. 

A set of strings H is consistent as soon as there exists a word x which 
includes every string in H. Since any word x includes a finite number of 
distinct strings, all the consistent sets of strings are finite. If word x includes 
all of the strings in H then we say that H is consistent by virtue of word x. 

It can be shown that a finite set of strings H is consistent as soon as the 
strings in it are compatible taken two by two. Note that there exist infinite 
sets of strings whose strings are compatible taken two by two. However these 
sets of strings are not consistent even though the strings in it are compatible 
taken two by two. A set of strings is consistent if and only if there exists a 
word which includes all of the strings in the set. 

6 Generalized Certificates 

In companion paper [3] we generalized the notion of certificate of membership 
of standard theory of NP relations. Our generalization consists in identifying 
strings as the appropriate mathematical entities suited for representing the 
certificates of membership. The idea is as follows: An input word x, whose 
membership in the reference set E has already been ascertained, actually 
belongs in the target set F as soon as it includes certain special strings that 
are characteristic of those words of set E that happen to be satisfiable. So to 
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speak, we assume that satisfiability is accompanied by signs: The observable 
signs of satisfiability. We assume that the signs are interspersed within the 
object. 

(As the Irish passerby is manifestly the bearer of signs of Irishmanship - 
otherwise I would't be able to recognize him as an Irishman among hundreds 
of Englishmen alongstreet in London - so shall the satisfiable formulas carry 
the signs of satisfiabilty.) 

We assume that, for individuals that are words over an alphabet, the signs 
can only consist of included strings. Our main object in present research 
effort is to construct the theory that will provide us with the set of the 
strings that characterize the satisfiable instances of an NP problem. What 
we need is the set of the signs. We will call it the logogram of the target set 
F relative to the base E as it comes up as a collection of logos. This exercise 
is the mathematical core of companion paper [5]. 

7 Connection between Strings and Words 

In this section we offer an overview of the mathematics through which we 
attempted the construction of the "set of the signs" in 0. 

To every recursive set E over alphabet E we associate the set Soo(-E), 
which is that subset of Sqo which contains all strings that occur in words of 
the reference set E. Thus, we define 

Soo(^) = {(? G : (3x G E)g < x}. (1) 

This is the set of all those strings g in Sqo whose associated cylinder Exp{g) 
intersects E: 

^^{E) = {ge^oo:Exp{g)nE^^. (2) 

Thus, Soo(-E') is the set of those strings in Sqo whose associated cylinder 
contains elements of set E. It is understood that E is the set of words over 
S that encode instances of some fixed reference computational problem 11. 
(Whenever we talk of a reference set E there is an implicit reference to some 
fixed computational problem 11 as well as to a program P solving 11.) 

In [5] we have shown that there is a Galois connection (in the original 
sense given in [3]) between the subsets of Soo(-E) and the subsets of E. The 
connection induces two closure systems: On 'Lao{E) on one side (convention- 
ally the left) and on E on the other (the right). 

The (involutory) isomorphism associated with the connection is a one-one 
onto correspondence between closed subsets F C on one side and closed 
subsets H C Eoo(-E') on the other. A convenient notation for the image of 
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set F C £■ under this isomorphism is LogE{F), and this suggests notation 
for the image of if C Soo(-E') through the inverse isomorphism. From 
the general theory of the Gafois connection, is then the set of those x in 
E which include strings from H. We then have 



holding for closed sets F ^ E and H C J]^[E). 

On the right side of the connection, the subsets F of E which happen to 
be closed are those that are relative cylinders in E. We understand that a 
subset F of i? is a relative cylinder in E as soon as there exists H C Soo(i?) 
such that F = E" = Expe{H). 

It is immediately seen that our notion of a cylinder set is an extension of 
the ordinary notion of a cylinder set as defined e.g. in |9]. Our notion of a 
cylinder reduces to the ordinary notion as soon as we confine to strings that 
are words. Besides, those subsets F of the reference set E which are cylinders 
in the ordinary sense can easily be shown also to be relative cylinders with 
respect to E. 

In present paper we will exhibit an appropriate encoding for SAT in 
which the reference set E is prefix-free. Everything becomes easier under 
this encoding: All subsets of E happen to be closed in E, and our theory 
becomes self-contained (there is no longer need to invoke cylindricity of SAT 
as proved e.g. in ^). 

Absolute cylinders Given g G Sqo, we define the (absolute) cylinder as- 
sociated with g to be the set 



Thus, Exp{g) is the set of those words over S which include g. Exp{g) is 
the absolute expansion of string g. The two notations Exp{g) and (S*)^ 
are interchangeable (we find it convenient to keep both). We also say that 
Exp{g) is elementary to mark difference with the nonelementary cylinders 
to be introduced below. We call g the signature of set A = Exp{g). 

Now assume that, as special case, x is a word over S. Then Exp{x) 
is the set of all words over S which exhibit prefix x. For a; = _L we have 
Exp{l.) = (S*)-*- = S*. Thus, S* is itself an elementary cylinder set: Its 
signature is _L. 

As next step, we define the general cylinder sets (just cylinders, not nec- 
essarily elementary). First, we define the expansion of a set of strings: Let 
H C Eoo be any set of strings over S. We define Exp{H) to be the set 



^Log^F ^ F^LogE^E") = H 



(3) 



Exp{g) = (S*)f = {s eJ:* : s> g} 



(4) 



Exp{H) = (S*) 



{x e S* : (3^ e H)x > g}. 



(5) 
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Thus Exp{H) is defined to be the set of all words in E* which subsume 
strings from H. That given, a set A C E* is defined to be a cylinder as soon 
as there is if C Eoo such that A — Exp{H). 

Note that the empty set of words is a cylinder since (E*)'^ = and is a 
subset of Eoo- However, is not an elementary cylinder. 

The last notion that we give in this list of notions is cylindrification. 
By this, we understand expansion restricted to sets of words: Given ^4 C E*, 
the cyhndrification of A is Exp{A) — (E*)"^. By definition, this is the set of 
all words in E* which include words from A. Now a word is a string whose 
domain consists in an initial segment of A^. Thus, a word y subsumes another 
word X if and only if a; is a prefix of y. Thus, Exp{A) = words in E* which 
exhibit a word from A as prefix. 

Theorem 1. Exp is a closure operation in E*. By the way, a topological 
one. 

Proof. (I) Every word in E* is a prefix of itself, hence every word in A has a 
word from A as prefix. Thus, A C Exp{A). 

(II) Exp{Exp{A)) — Exp{A). Indeed, if a word z belongs to first member, 
then z has a prefix y e Exp{A). Then y has a prefix x & A. We conclude 
that X is a prefix of z, hence z belongs in the second member. Equality of 
the two members follows from Exp{A) C Exp{Exp{A)) by (I). 

(III) Let Ac B CT,*. If any word x has a word from A as prefix then x 
has a prefix from set B also. 

This yields Exp{A) C Exp{B). 

This completes the proof that Exp is a closure operation in E*. 

(IV) Let A, B be any pair of sets of words in E*. Then Exp{A U B) = 
words over E that either have a prefix from A or from B. 

Clearly Exp{A U S) = Exp{A) U Exp{B). Thus, Exp is topological. □ 

Theorem 2. A C T,* is a cylinder set ijf Exp{A) = A. 

Proof. (1) Let A be a cylinder set. Then there is if C Eoo such that 
A — Exp{H). By taking the expansion of both sides we get Exp{A) — 
Exp{Exp{H)). 

We show that Exp{Exp{H)) = Exp{H). 

Indeed, let 2; be a word belonging to first member. Then z includes a 
word y G Exp{H). Being a member of Exp{H), y includes a string g from 
H. Then z includes g. Since z includes a string from if, it belongs to 
Exp{H). 

Thus, Exp{Exp{H)) C Exp{H). 

By Theorem 1, Exp restricted to E* is a closure operator. Hence Exp{H) C 
Exp{Exp{H)). 
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By way of consequence, Exp{Exp{H)) = Exp{H). 
Since A — Exp{H), we conclude that Exp{A) = A. 
(II) Let Exp{A) = A. 

Words are certain special strings. Thus, we may well affirm that there is 
a set of strings A such that A = Exp{A). Then A is an absolute cylinder. □ 

Theorem 3. (I) The intersection of two cylinder sets is a cylinder set, (II) 
The union of two cylinder sets is a cylinder set. 

Proof. (I) Let A, B be cyhnder sets. Since A is a cylinder, there exists 
H C Soo such that A = Exp{H) — (E*)^. Analogously, there exist K C Eoo 
such that B = Exp{K) = (S*)^. 

It follows that AnB= (E*)^ n (E*)^. 

The latter is the set of all words over alphabet E each of which includes 
at least one string from H and at least one from K. 

Let x be any word over alphabet E. It is easily seen that x includes 
a string from H and one from K if and only if x includes their join, thus 
(E*)^ n (E*)^ = (E*)^+-^. Since (E*)^+-^ is a cylinder, we conclude that 
the intersection of two cylinder sets is a cylinder set. 

(II) Let A = Exp{H) and B = Exp{K) be cylinder sets. 

Here H,K Cj:^. 

Then AU B = (E*)^ U (E*)^ is the set of all words over E each of which 
includes a string from H U K. 

Thus, AUB = (E*)^ U (E*)^ = (E*)^^^. 

Since (E*)^*-^^ is a cylinder, we conclude that the union of two cylinder 
set is a cylinder. □ 

We conclude this subsection with one further remark on the structure of 
the absolute cylinders. Let A be an absolute cyhnder. Prom Theorem 2, we 

have that A happens to be the same set as its own cylindrification. This is to 
say that A contains, together with any of its words x, all of the words that 
exhibit prefix x. 

Relative cylinders Given H C Eoo(-E'), we define 

Expe{H) ^ E^ = {x e E : {3ae H)x > a] = E f\ Exp{H). (6) 

Thus, E^ is the set of those words in E which contain strings from H. We 
call E^ the expansion of H relative to base E. For £^ = E* we regain the 
absolute expansion of H. 

Given any set ^4 C we define ^1 to be a cylinder in E as soon as there 
is H (Z Eoo(-E) such that A = E" . 
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Explicitly note that E is the intersection between an absolute cylin- 
der Exp{H) and the reference set E. We actually regard E^ as being a 
relativized cylinder, i.e., a cylinder relative to an underlying set E. 

We saw that cylindrification is a closure operation in S*. Besides, we 
proved that A C S* is a cylinder set as soon as Exp{A) = A. We derived 
these results in the absolute case. How do these results reformulate when we 
replace E* with an infinite recursive set of words E7 

Theorem 4. Cylindrification ExpE relative to a reference set E is a closure 
operation in E. By the way, a topological closure. 

Proof. (I) Every word in any set of words i? is a prefix of itself, hence every 
word in A C i? is a word of set E having a word from A as prefix. Thus, it 
belongs to Expe{A). Thus, A C Expe{A). 

(II) Expe{Expe{A)) — Expe{A). Indeed, if word z belongs to first 
member, then z is a word from E having a prefix y G Expe{A). But then 
y has a prefix x & A. Then a; is a prefix of hence z belongs in the 
second member. Equality of the two members follows from Expe{A) C 
Expe{Expe{A)) by (I). 

(III) Let A B C. E. If any word x E E has a word from A as prefix 
then X has a prefix from set B. This yields Expe{A) C Expe{B). 

This completes the proof that ExpE is a closure operation in E. 

(IV) Let A, B be any pair of sets of words in E. Then Expe{A U B) — 
words in E that either have a prefix from A or from B. Clearly Expe{A U 
B) — Expe{A) U Expe{B). Thus, ExpE is topological. □ 

Theorem b. A E is a cylinder in E iff Exp e{ A) = A. 

Proof. Similar to that of Theorem 2. □ 

Thus, Theorems 1 and 2 do hold also in the relative case. By way of 
consequence, given A <0 E, we have that A is a cylinder in E if and only if 
Expe{A) = A. Thus, A is a cylinder in E if and only if A is closed under 
cyhndrification relative to E. 

We conclude this subsection with a few remarks on compatibility and 
relative compatibility of strings. Let f,g be strings in Eoo(£'). Then there 
exist in E words which include / as well as words which include (7. If / and 
g are incompatible, then E certainly does not have words including both / 
and g. However, even if / and g are compatible, it is not necessarily the case 
that E shall have to contain words which simultaneously include both / and 
g. The whole question can well be reformulated as follows. Let us assume / 
and g compatible. Then f + g exists, but it is not mandatory that it belongs 
to Eoo(-E). If E fails to contain a word including both / and g, then / and 
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g are compatible and nevertheless they are inconsistent relative to reference 
set E. When that is the case, the join f + g fails to belong in (-£")• / and 
g compatible is necessary ior f + g to exist in 'E^i^E), but the same condition 
is by no means sufficient to conclude f + g & Eoo(-E'). 

Absolute and relative logograms In this section we first introduce the 
logogram of a set of words relative to a reference set E. Next, we regain the 
absolute logogram for E — T,*. For any F C E, we define 

LogE{F) = {ge : {\/s e E)s > g ^ s e E^"} (7) 

to be the logogram of set F to base E. Note that, since words are strings, 
E^ is defined. E^ is the relative cylindrification of F in E, and this in turn 
(since F is a set of words) is the set of all words in E that are prefixed by 
words in F. Observe that, if F is a cylinder in E, which is to say if F = E^ 
for some H C 'E^(E), then E^ — F hy Theorem 5. 

Besides. Loqe^F) is the set of those strings g whose associated cylinder 
Exp{g) cuts out of E an intersection set Exp{g) (IE which is (i) nonvoid, (ii) 
fully contained in the cylindrification E^ of F relative to E. For E = T,*, 
the above defining equation for the logogram is rewritten 

Log{F) = e Eoo : (Vs e E*)s >g^se Exp{F)} (8) 

This is what we call the absolute logogram of language F. Thus, Log{F) 
is the set of the strings g whose associated absolute cylinder Exp{g) is fully 
contained in Exp{F). 

We conclude this section listing two properties of mapping LogE that are 
used in the sequel. First, given A,BCE, one has: 

ACB^ LogE{A) C LogE{B). (9) 

This is monotonicity of the logogram. The second property is: 

LogE{A U 5) D LogE{A) U LogE{B). (10) 

Let us understand this inclusion. LogE{A U B) is the set of all strings that, 
for X in E, are able to trigger event x G E^^^ = E"^ U E^. A string that 
triggers x G F^ certainly belongs to LogE^AUB). Analogously, a string that 
triggers x G E^ certainly belongs to LogE{A[JB). Thus, LogE{A)\JLogE{B) 
certainly is a subset of LogE{A\J B). However, there can be strings / whose 
inclusion in a word a; G F is a sufficient condition for event x G E^ U E^ but 
not for X G E"^ or a; G E^ . Thus, in the general case LogE{A U B) is not the 
same set as LogE{A) U LogE^B). 
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Theorem 6. Theorem Let H C Soo(i?). The mapping which carries H onto 
Log{Exp{H)) is a closure operator. 

Proof. Proof. Exp{H) is the set of aU words in S* which include strings in 
H. Given any set of words ^4 C E* we have that Log (A) is the set of aU 
strings in E^o such that inclusion in any word in E* of a string from Log{A) 
is sufficient to ensure that that word belongs to cylindrification of A. 

Then Log{Exp{H)) is the set of all strings in Soo whose presence in a word 
from E* would be a guarantee for that word to belong in the cylindrification 
of Exp{H). However, the cylindrification of Exp{H), noted Exp{Exp{H)), 
reduces to just Exp{H). Then, Log{Exp{H)) is the set of all strings in Eo^ 
whose presence in a word from E* would be a guarantee for that word to 
belong in Exp{H). 

But then Log{Exp{H)) is the set of all strings in Eoo whose presence in 
a word a; e E* serve as symptom that word x also includes a string in H. 

(I) H C Log{Exp{H)). Indeed, if x includes a string from H, this is the 
most clear symptom that x includes strings from H . 

(II) Let H C K. Let g belong in Log{Exp{H)). Let x include g, then it 
also includes a string / from H. But f E K, then g is also a symptom for 
presence in x of a string from K. Hence g e Log{Exp{H)). 

(HI) LogExp{H) — LogExp{LogExp{H)). This is actually obvious. □ 

We may call just Log Exp the mapping '^oo which carries H 

onto Log{Exp{H)). Thus, LogExp is a closure operator. This closure is 
not topological. Indeed, it is not always the case that LogExp{H U K) — 
LogExp{H) U LogExp{K). There are cases where LogExp{H U K) contains 
certain special kinds of a string g whose presence in a word behaves as a spy 
for the presence in that word of a string which in some cases is from H and 
in others is from K. Clearly, such a string g couldn't be a member of either 
LogExp{H) or LogExp{K). String g would serve as a collective sympton for 
both strings from H and from K. This is a key point in this research. 

Theorem 7. Given ACE, ExpE{LogE{A)) = E^. 

Proof. (I) Let x G ExpE^LogE^A)) . Then a; is a word in E that includes at 
least a string g e LogE{A). By definition of logogram, x is in E"^. 

(II) Let X e E^. Thus, x is a word in E which includes a string from 

A. Since A C LogE{A), we also have that x is a word in E which includes a 
string from LogE{A). The set of all words in E which include strings from 
LogE^A) is ExpE{LogE{A)). Hence x G ExpE{LogE{A)). □ 

Note that, since LogE{A) and \LogE{A)\ are isoexpansive, we also have 
ExpE\LogE{A)\ =E^. 
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8 Kernel of a Decision Program 



With any NP problem {E,F) we associate a set of strings \LogE{F)\ called 
the reduced logogram of F relative to E, which conveys structural informa- 
tion on E, F, and how F is embedded in E. We assume F to be a relative 
cylinder in E (the rationale for this assumption will be given in Section 12). 

The strings in \LogE{F) \ serve as certificates of membership for F relative 
to E. This means that, limited to words in E, to include one or more strings 
from \LogE{F)\ is necessary and sufficient for membership of a word in F. 

In principle, we cannot exclude that \LogE{F)\ may contain strings that 
behave as collective witnesses, also called wizards. There exist problems, 
e.g. PRIMES, where \LogE{F)\ has wizards. (Note, incidentally, that 
PRIMES is in P [1].) Should that be the case, a program P solving [E, F) 
might do calculations that are functionally equivalent to testing input x for 
wizards: That would make the task of deciding about input easier. 

Let program P solve problem {E,F). The tests in \LogE{F)\ are those 
that P can use: They are so to speak at disposal for a program P. Which of 
these tests are actually used by P is a different story. We define the kernel of 
program P, noted Ker{P), to be the set of the strings from \LogE{F)\ that 
P actually uses for making decisions. The strings in Ker{P) are uniquely 
identified by the algorithm that P implements. The composition of Ker{P) 
in terms of strings can also be determined through experiments with the 
executable of P. 

Let if be a subset of the reduced logogram \LogE{F)\ of F relative to 
E. We call H complete for problem [E, F) as soon as, for any x E E, 
we have that x actually belongs to F if and only if x includes at least one 
string g from H. If no proper subset of the reduced logogram \LogE{F)\ 
happens to be complete for {E, F) then we say that \LogE{F) \ is irreducible. 
Since Ker{P) is a subset of the reduced logogram, when this last happens to 
be irreducible we have that any two programs that solve {E, F) have same 
program kernel, written Ker{P) = Ker{Q). This by no means implies that 
P and Q shall have to exhibit equal time complexities, however we can make 
informal remarks that suggest that further investigation of the matter might 
be worthwhile in this case. 

In companion paper we proved that, for E = CNF, F = SAT, problem 
{E, F) cannot have collective certificates in its reduced logogram, and this 
in turn yielded that its reduced kernel \LogE{F)\ is irreducible. We derived 
these results from a property of structural independence of SAT that we 
called "strong internal independence." 

In what follows we prove that SAT exhibits a strictly stronger form of 
structural independence, that we call "complete internal independence." We 
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argue that this form of independence of a decision problem is the strongest 
possible for a decision problem. Using this stronger property, we will be able 
to give more explicit form to the informal remarks that we made in |5] on 
the time complexity of SAT. 

9 Complete Independence of events 

We consider a decision problem {E,F). We already noted that, if g E 
\LogE{F)\, then ExpE^g) = Exp{g)r\E is fully included in F. Thus, ExpE^g) 
is an elementary cylinder relative to E, and is fully included in F. The tar- 
get F of problem {E, F) is covered with sets ExpE^g) where g G \LogE{F)\. 
Statements such as a; G ExpE^g) with g G \LogE{F)\ will be called "events 
in the universe -F," and we can rely upon the fact that both the union and 
the intersection of two events in universe F are events in universe F. 

Here the events in the universe F are subsets of F being closed in E. We 
are using "event" for subsets of E of the form E^ where H C Soo(-E')- 

Thus, given any decision problem {E,F), we introduce the cover of the 
target set F associated with \LogE{F)\ to be the family of sets 



Its members are the regions of the cover. The cover that is associated with 
the kernel of a program P solving {E, F) is then 



Both Ve{F) and J-'p{E, F) are collections of subsets of the target set F whose 
union is F, with J-'p{E, F) being a subcoUection of Ve{F). 

We are now in good position to define a notion of complete independence 
of a finite set of events in the universe F. By a partition of the set F we 
understand a collection of events within universe F being pairwise incom- 
patible and exhaustive. Two events in universe F are incompatible as soon 
as the corresponding sets are disjoint. By a collection of events in universe 
F being exhaustive we understand that, if input x is in F, then at least one 
of them will occur. 

Consider a finite set of events Ei, E2, .., Em- We will define a notion of 
complete independence among them. We will reduce the general case to the 
simple case of a partition. To this end, we take under consideration the 2"^ 
products UiU2--Um where Ui can be either E^ or its complement E^ = F — Ei. 
We may obtain the 2™ formal products by developing the form 



Ve{F) = {ExpEig) ^F:ge \LogE{F)\}. 



(11) 



^p(F,F) = {ExpE{g) <^F:ge Ker{P)}. 



(12) 



{E, + E'^){E2 + E^)..{Em + E^J 
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Here product means intersection and we use sum for union to stress that 
it is disjoint. Some of the products can be void, and we do not take 
care of them. Those that are nonvoid take the name of atomic constituents 
Ci, C2, .., Cs of the partition induced by events Ei, E2, .., Em where s < 2™. 

We say that m events Ei, E2, .., Em are completely independent as soon as 
they give rise to 2"^ nonvoid atomic constituents. If m events are completely 
independent, then every one of them remains uncertain (we do not know if 
it happened) even if we are notified the outcome of each of the other m — 1. 

10 Witnesses and Wizards 

The computations that a decision program P performs on an input x are 
functionally equivalent to sequences of tests done on x. The tests that P 
can perform on an input, hence those that can occur in one such sequence of 
tests, are those that search the input word for strings g e Ker{P). This is a 
straight consequence of how we defined Ker{P). We actually assume that P 
can only do calculations that encode tests on x that belong to this collection 
of tests. 

Searching x for a string g amounts to asking if x happens to belong in the 
absolute elementary cylinder Exp{g) associated with g. We thus arrive at 
the conclusion that all that P can possibly do to arrive at a decision consists 
in asking questions of this form. Note that P has not got to ask whether x is 
in ExpE{g) since P already knows that x is in E. This is an important point 
since asking if x e ExpE^g) would be more computationally expensive. Thus, 
as long as we are dealing with words whose membership in E has already 
been ascertained, we can freely exchange the relativized cylinders ExpE{g), 
ExpEif),-- with the corresponding absolute cylinders Exp{g), Exp{f),.. . 

In our theory, the state of knowledge of a running program P at any stage 
during computation P{x) on input x consists of a pile of assertions, namely 
those that have been collected up to that computation stage. The assertions 
that are derived in one and same computation are certainly mutually consis- 
tent (since they assert properties pertaining to one and same object x). We 
are indebted to Dana Scott for this style of looking at computations. When, 
for some g G Ker{P), program P asks if x belongs in Exp{g) (more pedan- 
tically, when P performs calculations that amount to testing if x belongs in 
Exp{g)) this question always gets an answer. The answer can be a "yes" 
when the test is passed, or a "no" when it is failed. In case of a negative 
answer, the new piece of information "x is not in Exp{g)" is acquired on part 
of the program P. Since ExpE{g) is a subset of Exp{g), the more interesting 
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piece of information "x is not in Expsig)" can be inferred from "a; is not 
in Exp{g)." Thus, in this case the state of knowledge of program P gets en- 
larged by the addition of the new piece of information "a; is not in ExpE{g)" 
In case of success the new piece of information which is acquired is of course 
"x is in ExpE{g)" Since it is already known to P that x is in E, in this case 
the state of knowledge of program P gets enlarged by the addition of the new 
piece of information "x is in ExpE{g)" (Remarks on what program knows at 
various stages in a computation shall not be regarded as ventursome as they 
can be made formal using methods of model theoretic analysis of program 
knowledge [6].) 

In this theory, information regarding x is acquired by P in lumps. The 
acquisition of a piece of information occurs at the moment when the execu- 
tion of a sequence of tests is completed. We may well think of a piece of 
information as being a piece of paper carrying a written note such as "x is in 
ExpE{g)" or else "x fails to be in ExpE^g)" These notes stack one upon the 
other until the pile becomes a decisive one: This is the case when the data 
that was gathered entails one of the events x G F or x G i? — F. (It must be 
given credit to Scott for these conceptual contents of the theory.) 

Since Ker{P) is a subset of \LogE{F)\ which is a reduced set of strings, 
it cannot be that Ker{P) contains two substrings f,g with / included in 
g. Hence the cylinder associated with, say, / cannot include the cylinder 
associated with g. Thus, P never tests membership of input x into two 
cylinders one included in the other. 

It is instead possible that a cylinder ExpE{g) with g in Ker{P) will 
intersect one or more other such cylinders, and be completely included in the 
union of the intersected cylinders. This can happen when g is a. witness for 
a whole bundle of solutions, or (in somewhat more esoteric manner) when g 
is a wizard. The next two paragraphs have the details. 

Witnesses and wizards formally defined Assume that we are consid- 
ering an input word x of size n. Let be a string in the reduced logogram 
\LogE{F) \ of an NP problem {E, F) so that ExpE^g) is a subset of the target 
set F. We assume that the size of string g is less or equal to n, which is 
written \g\ < n, thus g actually belongs in \LogE{F"')\. 

Let us remember that, given any NP problem [E, F), there is a decom- 
position of the target set F into subsets Fj called solution regions, where Fi 
is the set of those words x in F that are satisfied by solution 

That being granted, we say that is a witness as soon as its associated 
relativized cylinder ExpE^g) is fully included in at least one of the F^s. We 
say that g is a proper witness as soon as its associated relativized cylinder 
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ExpE^g) is fully included in exactly one of the FjS. Equivalently, (? is a proper 
witness as soon as g is an encoded sign of satisfiability that points toward a 
unique solution yk- If Expsig) is included in the intersection of two or more 
of the FiS then g is an improper witness (also called a pseudowizard) . 

We say that is a wizard as soon as ExpE{g) fails to be fully included in a 
solution region. Thus, when is a wizard, its inclusion in a word x belonging 
to the reference set £" is a guarantee that x is satisfiable while nothing can 
be said about what particular solutions satisfy x. 

Immediate properties of the wizards Take g G \LogE{F)\, and assume 
that ExpE{g) fails to be fully included in any one single solution region F,. 
Then g is a wizard. To simplify things, assume that ExpE{g) is fully included 
in the union of two solution regions Fh and Fk- Then the situation is 

ExpEig) ^FhU Fk, ExpEig) 2 Fh, ExpE^g) ^ Fk. (13) 

Let us develop our remarks for some fixed echelon i?", where n > \g\. Let 
fi,..,fs be all the strings in U°'^^\LogE{F^)\ whose associated relativized 
cylinders intersect ExpE^g): 

ExpE{g) n ExpeUj) 7^ 0, J = 1, s (14) 

The reader will remember that \J^^i\LogE{F^) \ is the set of all the witnesses 
that exist in the reduced logogram \LogE{F'^)\. Thus, the strings /i, .., fs are 
exactly those witnesses in the reduced logogram \LogE{F'^) \ whose associated 
relativized cylinders happen to intersect ExpE{g)- 

The strings fi, ■■, fs will be referred to as the witnesses associated with g. 

We set ExpE^fj) = Cj all j = 1, .., s. Note that none of sets Ci, .., Cg is 
included in ExpE^g)- Indeed, should Ck be a subset of ExpE^g), we would 
have g < fk, a.n absurd since both g and fk are members of the reduced set 
\LogEiF-)\. 

We conclude this section proving a theorem. 

Theorem 8. If g is a wizard and fi, ..,fs are its associated witnesses, then 
ExpE^g) is properly included in the union of the corresponding cylinders 
Ci = ExpEifi), .., Cs = ExpEifs)- 

Proof. Let {E,F) be any NP problem with Fi,..,Fa[n) as solution regions 
associated with nth echelon {E"',F"'). 

It follows from Theorem 7 of [5] that the set Ufii^ \FogE{Fp)\ of all wit- 
nesses in the reduced logogram \LogE{F^)\ is complete for problem {E, F). 
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This implies that the union of the relativized cylinders associated with 
the strings in Ufii'* \LogE{FJ^)\ is coincident with the whole of the target set 
F" of problem {E^, F"). (In this proof we are reasoning echelon by echelon.) 

As consequence, the relativized cylinder Expsig) is coincident with the 
union of its intersections with cylinders Ci, .., Cs- 

Hence, ExpE^g) C Ci U C2 U .. U C,. 

By contradiction, let us assume that ExpE^g) = Ci U C2 U .. U C^. 
Then Ci U C2 U .. U Cg is an elementary relativized cylinder, and g is its 
signature. 

From ExpE{g) = Ci U C2 U .. U follows Cj C ExpE^g) all j = 1, .., s. 

Cj C ExpE^g) is rewritten E n Exp{fj) C E n Exp{g), and from this last 
inclusion follows Exp{fj) C Exp{g) all j = 1, .., s. 

Hence, g < fj all j = l,..,s. Absurd, since g and fj are members of 
\LogE{E"-)\, which is a reduced set of strings. □ 

We conclude that, if is a wizard, then, with notations given, (i) ExpE^g) 
is a proper subset of Ci U .. U C^, and (ii) Cj ^ ExpE^g) and ExpE^g) % Cj 
all j = 1, 

Note that (i)-(iii) is not sufficient to conclude that is a wizard. 

11 Structural Independence of Problems 

One of the themes of this paper is the study of finite collections of subsets 
of the target F of a decision problem {E, F). This study is related with our 
interest for notions of structural independence of decision problems. 

We have seen that both Ve{F) and J-'p{E,F) are collections of subsets 
of the target set F of problem {E,F), with J^p{E,F) being a subcollection 
of Ve{F). We also have seen that the elements of these collections are 
elementary relativized cylinder sets. We defined Ve{F) to be the collection 
of all sets of the form ExpE{g) where g G \LogE{F)\: Thus, there is one-one 
onto correspondence between the elementary relativized cylinders in Ve{F) 
and the strings in \LogE{F)\. J^p{E,F) is the subcollection of Ve{F) that 
is associated with Ker{P). 

Pairwise independence of strings Let f,g& Iloo{E) for the whole para- 
graph. We say that / entangles g relative to E as soon as all x G F which 
include / also include g (note that since f,g & Soo(F) there exists x E E 
which includes / and there exists y & E which includes g). In [5] we devel- 
oped a theory of entanglement among strings. That / entangles g relative to 
F was denoted / g. 
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/, g independent relative to E means that neither / □'^ g nor g /. 
Thus, /, g independent relative to E means that (i) there is x G -E which 
includes / and does not include g and (ii) there is y E E which includes g 
and does not include /. 

Note that / g if and only if ExpE^f) C ExpE^g)- Analogously for 
g /• Then f,g independent relative to E means that neither ExpE{f) C 
ExpE{g) nor ExpE^g) C ExpeU)- 

Internal independence of a problem We call problem (£", F) internally 

independent as soon as the strings in \LogE{F)\ are pairwise independent 
relative to E. Thus, {E,F) internally independent = T>e{F) is an antichain. 

Strong internal independence In our previous paper [S] we defined the 
property of strong internal independence of a decision problem. We defined 
problem {E, F) to have this property as soon as for any choice of s distinct 
strings /i, .., in \LogE{F)\, the following is true: For every i between 1 and 
s there exists a word Xi E E such that Xj contains /j and fails to contain any 
of the remaining strings in {fi, .., fg}. 

Theorem 9. Strong internal independence of a decision problem implies 
internal independence. 

Proof. By contradiction, assume that {E, F) exhibits the strong internal in- 
dependence property and does not exhibit the simple dependence property. 

Thus, not all of the strings in \LogE{F)\ are independent relative to E 
taken two by two. This is to say that there exist f,g E \LogE{F)\ such that 

ExpeU) ^ Expsig). 

With reference to the definition of the strong internal independence prop- 
erty we take /, as a particular choice for /i, .., fs- Since {E, F) has the strong 
internal independence property we have that (i) there exists x E E such that 
X > f and X ^ g, (ii) there exists y E E such that y > g and y ^ f. 

On the other side, since x > f and x E E, we have that x G Expsif)- 
By the contradiction hypothesis, we also have x G ExpE{g)- This implies 
X > g, an absurd. □ 

Complete internal independence We shall say that problem [E, F) has 
the complete internal independence property as soon as, given any finite 
set of pairwise compatible strings in \LogE{F)\, there exists x E E 

such that (i) x includes each of the fi, .., fs, (ii) x fails to include any other 
g G \LogE{F)\ except possibly those that are subsumed by /i + .. + /«. 
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In Section 9 we defined complete independence of a finite collection of sets 
in T>e{F). We may well rephrase our definition of complete independence of 
a decision problem as follows: (-E, F) is completely independent if any finite 
collection El,.., Eg of pairwise intersecting regions in T>e{F) is completely 
independent. 

12 Application to SAT 

The encoding scheme that we adopt converts CNF formulas into words over 
E = {0, 1, 2}. In what follows E = CNF, F = SAT. 

Wc represent clauses over Xi, .., x„ by sequences of n codes from E. Code 
denotes absence of the variable, code 1 presence without minus, code 2 
presence with minus. E.g., clause xi V X3 V — X4 becomes 1012. 

A whole formula is encoded as a sequence of clauses. We define F""* = 
satisfiable formulas with n variables and m clauses. 

Every encoded formula has a prefix of the form 0.. 010. .01 consisting of n 
Os followed by a 1 followed by m Os followed by a 1. A program P solving 
{E, F) learns the current values of n, m from this prefix. P shall have to be 
aware that the nm characters on the immediate right of the prefix encode m 
clauses over n boolean variables. 

The reference set E — CNF is thus encoded as a prefix- free language over 
S = {0, 1, 2}. Given any subset ACE, one has Expe{A) = Exp{A)r\E = A. 
Thus, any subset A C E is a cylinder relative to reference set E. This is 
to say that any A C E is a, closed set in E. Note that these simplifications 
fairly match with usual programming practice. Note that the target F of 
problem {E, F) is in any case a cylinder relative to E, as required by theory 
developed above. 

The size (or complexity) \x\ of a boolean formula x is the number of distinct 
variables that have occurrences in x. By the effective size of a satisfiable 
boolean formula x we understand the minimum number of value assignments 
to variables that are needed to evaluate the formula to 1. We define the 
effective size of an unsatisfiable formula to be the size of the formula. 

Example Consider formula x = {xiVx^V —X4)A{x2V —x^). The size is 4, 
the effective size is 2. Indeed, the two value assignments to variables Xi = 1 
and X2 = I are sufficient to set x to 1. This is not the only partial value 
assignment to variables that satisfies x. For example, the partial assignment 
X4 = 0, X3 = is also sufficient to set the value of x to 1. 

If size and effective size of a formula are different we say the formula is 
bewitched (a formula is unbewitched if the two numbers are equal). If x is 
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bewitched, then certainly x includes a pseudowizard. Let us verify this on for- 
mula X in the above example: The encoded version of x is 000010011bl2bl2b. 
It subsumes the strings 00001001 Ibbbblbb and 00001001bbb2bb2b that are both 
pseudowizards. (Remember that pseudowizards are witnesses.) 

The unbewitched formulas form a hardest subset of SAT, and we may 
ignore bewitched formulas without loss of generality. 

We introduce the sequence yi,y2, ■■ of solutions, and the corresponding se- 
quence Fi,F2,.. of recursive subsets of F. Here the solutions i/i consist 
of value assignments. The cardinality function is a{n) = 2". The target 
set F = SAT as well as the solution regions Fi,F2,.. are closed sets in 
E = CNF. Thus, all these sets are relative cylinders in E. These assump- 
tions correspond to properties of SAT that can be derived under various 
other encoding schemes [2] [7]. 

Before we prove our main result in this paper, let us spend a few words 
on the logogram of SAT. A string in \LogE{F"'''^)\ is a prescription that 
a word in F"™ may or may not be conformant with. We may represent a 
string in \LogE{F"'"^)\ as a word of length nm over {b} U S (ignoring prefix). 
Example for n = m = 3: String bbllb2b2b prescribes that first clause shall 
include X3, second shall include xi and — X3, third shall include —X2. Note 
that strings in \LogE{F"'"^)\ only prescribe either 1 or 2 as values (by the 
minimality property of reduced logogram). 

Theorem 10. Let E = CNF, F = SAT. Problem (E, F) exhibits the com- 
plete internal independence property. 

Outline of proof. We prove that, given s distinct, pairwise compatible 
strings /i, .., fs from the reduced logogram \LogE{F"'"^)\, there is x & E such 
that (i) X > /i + /2 + .. + fs, (ii) for all of the remaining g G iLogEiF"^"^)] 
one has x > g if and only if g < fi + f2 + ■■ + fs- 

Proof. We consider s distinct strings fi, ■■, fs taken from \LogE{F"^^"^)\. Thus, 
regarded as a partial function, each fi will assign only values 1 or 2. We 
assume that fi, .., fs are compatible taken two by two. 

We shall rely upon the main result proven in [5] namely that SAT has 
no wizards. By virtue of this result, none of the strings /i, .., /s is a wizard. 
That means that each of them is a witness, which in turn means that each 
of them consists of a consistent prescription of exactly one literal to each 
of the m clauses. That a prescription is consistent means that it does not 
comprise (i) an assignment of a literal to a clause and (ii) the assignment of 
the negative of that literal to another clause. 

Let us consider any two of the strings fi, .., fs, call them f^ and fk- 
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Since fh and fk are compatible, we cannot find two distinct integers 
1 < i < n,l < j < m, such that (a) fh assigns hteral Xi to clause j, (b) fk 
assigns literal —Xi to same clause j. 

Since that holds for any pair fh, fk, we take it for granted that, if one 
of the strings fi, .., fs prescribes a literal to one of the clauses, none of the 
remaining strings within fi, .., fs will prescribe the negative of that literal to 
the same clause. 

That being granted, we define x to be that uniquely identified formula 
that has a literal in a clause if and only if at least one of the strings fi, .., fs 
prescribes that literal to that clause. 

Formula x subsumes everyone of the strings /i, ..,/s, hence x subsumes 
their join fi + f2 + ■■ + fs- 

Let g e I Log's (F""*) I and assume x > g. 

By way of contradiction, assume that fi + ■■ + fs ^ g- We write / for 
fi + .. + fs so that we have Dom{f) = Dom{fi)U..LlDom{fs). The hypothesis 
taken by contradiction is rewritten f ^ g. 

Prom our definitions about strings, we have that f > g ii and only if 
Dom{g) C Dom{f) and, for any i e Dom{g), f{i) — g{i). Then, since 
f ^ g. only the following two cases are possible: 

Case 1. Dom{g) ^ Dom{f). 

In this case, there exists k G Dom{g) such that k ^ Dom{f). This last is 
rewritten k ^ Dom{fi), .., k ^ Dom{fs). 

We made the assumption x > g. Since g prescribes only 1 or 2 as values, 
this assumption implies x{i) ^ for all i e Dom{g). In particular, we have 
x{k) 7^ 0. This is absurd since, by the above construction of formula x, we 
have x ^ only in Dom{f). 

Case 2. Dom{g) C Dom{f) but there is an argument value i e Dom{g) 
where f{i) ^ g{i). 

Since Dom{g) C Dom{f), we certainly have i e Dom{fi) or 2 G Dom{f2) 
or., or 2 G Dom{fs). With no loss of generality, we assume i G Dom[fi). 
We then have g{i) ^ fii})- 

However, since x > g and i G Dom{g), we also have x{i) = g{i). Then 
x{i) 7^ fi{i), a contradiction since f — fi everywhere in Dom{fi). □ 

13 On the Time Complexity of SAT 

In this section we set forth a hint of proof of the exponentiality of SAT in 
the light of Theorem 10 above together with Theorems 8, 9, 10 of companion 
paper. Remember that E = CNF, F = SAT. We conjecture that a formal 
proof can be derived following this hint. 
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Theorem 11. SAT is exponential. 

Our hint of proof consists of two parts, (I) and (II). 

(I) It follows from Theorem 10 of [5j that there is a unique subfamily JF of 
Ve{F) such that F = IJ-^, namely JF = Ve{F) itself. As a consequence, for 
any proper subfamily JF c T>e{F) one has F 7^ IJ JF. 

We then have that it cannot be that J-'p{E,F) is a proper subfamily of 
the full cover Ve{F), otherwise we would have F 7^ \J J^p{E, F), and then 
P could not be correct as a program. In particular, since the cardinality of 
T>e{F) grows exponentially with word size, we have that Tp{E,F) is not 
allowed to be a polynomial subfamily of T>e{F). Thus, no search algorithm 
for SAT can only search a polynomial family of sets. 

(II) It remains for us to discuss the possibility that one single algorithm can 
solve the full search problem for x by directly searching the full exponential 
family T>e{F) in polynomial time. However this can scarcely be the case 
due to complete absence of any form of dependence among subsets in the 
reduced logogram \LogE{F)\ for E = CNF, F = SAT. By this lack of 
internal dependence, any computation of a program P solving [E, F) is such 
that the result of any computation step does not change the results that are 
left possible for the subsequent steps. In the rest of this part we make a few 
informal remarks on how this lack of dependence comes into play. 

We take a general purpose program machine M as computation model. 
(That M is a program machine means that the process carried out by M 
is determined by a running program.) We assume that only one program 
is running at any moment of time within M. We keep machine M fixed 
while we consider an infinite set of programs solving SAT (actually the set 
of all programs that run on M and solve SAT). We emphasize that the 
hardware is kept fixed while different programs all running on that hardware 
are compared. 

Let B{x, k) be a program running on machine M which for any given 
input X G -E of size n and every integer k between 1 and 2^ will decide if 
X has solutions in the range between yi and yk- Take TimeB{x, k) be the 
number of time units that algorithm B uses on inputs x, k on machine M. 

(Ila) We consider a{n) = 2^ strings from reduced logogram \LogE{F^"^)\ 
call them /i, .., fain)- Since SAT has no wizards, each of these strings is a 
witness. By way of consequence, each / within /i, .., fa(n) is associated with 
one bundle of solutions (one single solution when / is a proper witness). 

With no loss of generality, we may choose the strings /i,..,/a(„) to be 
all proper witnesses. Besides, we may well choose the strings so that, for 
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j = 1, ..,a{n), the unique solution associated with fj is yj. Note that we do 
not assume that the strings /i, .., fa{n) are pairwise compatible. 

By the strong internal independence of SAT, for every j = l,..,a{n) 
there exists a word x G F"^™ which includes string fj and fails to include 
all of the remaining strings in /i, .., fa{n)- Thus, x is satisfied by solution yj. 
Nothing can be said of the remaining solutions since the fact that x fails to 
include /j where i j does not exclude that x can possibly include other 
witnesses associated with solution t/j. 

However, in the above argument we may keep fj fixed while varying, 
for all i = 1, .., a{n),i ^ j, the string over \LogE{F™'-)\ in any possible 
manner. We then conclude that, for any single j between 1 and 2", there 
exists an a; e F""* which is satisfied by yj and fails to be satisfied by any 
other solutions yj 7^ yj. 

In particular, given any integer k such that 1 < /c < 2", there exists 
X G F""* which is satisfied by solution yt+i but is not satisfied by any one of 
the solutions in the range from yi and yk- Thus, it may well be the case that 
B{x, k) = and B{x, A; + 1) = 1. This statement is a prerequisite for what 
follows. 

(The above argument may seem to us very obvious in the light of our 
empirical understanding of SAT. The internal independence properties of 
this problem put this on deductive bases.) 

{IIP) We will prove the following. 

If for any unsatisfiable x G E"-"^ and any A; < 2" we have TimeB{x, k) — 
TimeB{x, k + 1), then there exists another program A^ solving SAT on hard- 
ware M such that 



Indeed, under the above hypotheses on M, we can speak of the class of all 
programs B, C,.. that solve SAT on machine M. Given any input y of size 
n, we can then introduce a most efficient program Ay for particular input 
y in this class. We understand that Ay is a most efficient program for a 
particular input y as soon as Ay solves SAT on machine M and, besides, 
Tim.eAy{y,2'^) < Timec{y,2"') for any other program C solving SAT on 
hardware M. Thus Ay is no worse than any other C solving SAT on M 
limited to this particular input y. 

We shall prove that, for an unsatisfiable input x, we have TimeA^ix, i) < 
TimeA,{x, z + 1) alH, 1 < i < 2". 

If we ignore improper witnesses (which is to say that we disregard be- 
witched formulas) we have: 



TimeA^ix, k) < TimeA^{x, k + 1) 
TimcAS^X) < TimeB{xX)- 



(15) 
(16) 
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\LogE{F,+i)\n[J \LogE{F,)\ 







(17) 



From Theorem 10 of present paper follows 



[j\LogEiF,)\^''\LogE{F,+i)\ 



(18) 



\LogEiF,+^)\^^ [j \LogEiF,)\ 



(19) 



(For brevity, in this informal hint we omit the proof that Equations 18, 19 
follow from the complete independence of SAT.) 

Thus, the two sets of strings \LogE{Fi+i)\ and Uj=i \LogE{Fj) \ are disjoint 
and there is no entanglement between them. 

Since is optimal on input x, algorithm Ax{x,i) may perform on x 
calculations that implement the search of word x for strings in \LogE{Fi+i)\ 
only if these calculations can be performed within the same set of machine 
cycles that are allocated to the calculations that implement the search of 
word X for strings in U}=i \FogE{Fj)\. 

For this overlap to be an affordable statement it is required that the 
first search (the one for strings in \LogE{Fi^i)\) should be implemented by 
the same set of calculations that implement the second search (the one for 
strings in U}=i \LogEiFj)\) so that, while performing the second search, also 
the first would be implicitly executed. By theory developed in [5], this kind of 
magics can actually occur through the mechanism of entanglement between 
sets of strings. That mechanism allows for different computation tasks to 
be performed by one and same calculation: It is well possible that one and 
the same computation implements distinct operations of search, acting upon 
distinct sets of strings. (In present theory computations occur as searches 
for strings in disguise: The calculations that a decision program P performs 
actually implement operations of search of an input word x for certain strings 
in the kernel of P.) 

However, from Equations 17-19 it follows that the two sets of strings 
U}=i \LogE{Fj)\ and \LogE{Fi+i)\ are disjoint and there is no entanglement 
between them. By this lack of entanglement, any overlap between an op- 
eration of search for strings in \LogE{Fi^i)\ and an operation of search in 
U}=i \LogE{Fj)\ is excluded. 

By virtue of (i) this lack of entanglement, together with (ii) the as- 
sumed optimality of the search performed by algorithm Ax{x,i) within set 
lj}=i \LogE{Fj)\, we have that the computation that algorithm Ax{x,i + 1) 
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performs cannot take less time than the sum of TimeA^ (a;, i) plus the min- 
imum time needed to search x for strings in |Log'E(-^i+i)|- Since this latter 
time cannot be 0, for all those x e which are not satisfied by solutions 
in the range from yi to ^j+i we have TimeA^{x, i) < TimeA^{x, i + 1). Hence 
Equation 15 certainly holds for all unsatisfiable x G i^"™ and all integers k 
such that 1 < /c < 2". Equation 16 holds by definition of algorithm A^.. 

We conclude that an unsatisfiable x requires exponential time on hard- 
ware M. 

The step from the above informal hint to a possible formal proof is cer- 
tainly not immediate. We should at least be able to prove that our model of 
computation M solves in polynomial time all and only those decision prob- 
lems that are polynomial on Turing machines. Besides, the theory based on 
the new model should provide some formal notion of independence between 
computations. 

14 Conclusions 

We advocated strings as a fundamental notion for studies on computation. 
Strings are useful to express the notions of simple, strong, and complete in- 
ternal independence of a decision problem. We have been led to use strings 
to become able to define the very basic notion of internal independence of a 
decision problem. Strings seem to be useful since they are absolutely elemen- 
tary. Strings are already at work in Computability. The "restrictions" that 
are often used in the study of circuit complexity are almost one and same 
notion as strings. (It seems to us that "string" is the correct name.) 

By way of curiosity: Our strings seem to be (from a layman's view) like 
sampled and discretised versions of the "strings" that the physicists use in 
their "string theories." 

Strings are not made of consecutive letters. A string can be interspersed 
in a word: By canceling zero or more letters in a word .x, and leaving blanks 
in places of letters, we get a string / which is a substring of the original 
word X (the pun is innocuous: There is no danger in saying that string / 
is a substring of x). In strings, one has information associated with spaces 
between letters (and hence with multiple periodicities with which letters may 
occur in long words). 

As soon as we have the strings, we are able to define the kernel of a 
decision program P, noted Ker{P). This is a set of strings which capture 
structural features of both P and the decision problem [E, F) that P solves. 

The program kernel Ker{P) is a subset of \LogE{F)\, the reduced lo- 
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gogram of the target set F in base E. The reduced logogram consists of 
substrings of the words in F which exhibit the following property : If a 
word in E includes one of these strings then it belongs to F. We may think 
of the strings in \LogE{F)\ as kind of genes of the words in F. In early 
notes the logogram was the genie of problem {E, F). The idea clearly comes 
from biology, where it is known that certain occurrences at given intervals of 
certain letters within DNA strings convey structural information, and yield 
observable characters in the macroscopic development of the structures. 

Our application to SAT uses structural properties of that problem that 
seems to have escaped attention so far. We called them "strong internal 
independence" and "complete internal independence." In companion paper 
we showed that SAT exhibits the strong internal independence property. In 
that paper we have shown that, by that property, SAT cannot have collec- 
tive certificates in its reduced logogram. This was our main result in that 
paper. Starting from that result, we proved in this paper that SAT exhibits 
a stronger form of structural independence that we called "complete internal 
independence." 

It seems to us that SAT is difficult due to this extreme form of internal 
independence: On unsatisfiable inputs, any program solving SAT has expo- 
nential worst-case complexity. Our arguments for this conclusion use (i) the 
complete independence of SAT together with (ii) absence of wizards, that 
we proved in previous paper. As we are not able, so far, to produce a strictly 
formal proof (for lack, by far, of a suitable model of computation), we just 
outlined the ideas of a proof that would use properties (i), (ii). We conjec- 
ture that SAT can be proved exponential using the complete independence 
of SAT together with absence of wizards. We expect that a proof that uses 
these properties will appear in short time, possibly following the lines of our 
hint of proof for Theorem 11. 

Various relevant candidate proofs of P 7^ NP have been set forth in recent 
years. For some, we do not even have as yet detailed arguments that they fail. 
A common feature of these efforts is that they try to derive P 7^ NP from 
properties that are known since decades, or use portions of theory (especially 
one-way functions) that are known since much time. A peculiar feature of 
our research is that it comes together with a completely new theory, that 
has applications in diverse fields, and definitely cannot be compressed in few 
pages. Besides, the property of SAT that is used is completely new in studies 
on computation. 
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